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SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: CHOO, Yen 

6 KLUG, Aaron 

7 SANCHEZ GARCIA, Isidro 

9 (ii) TITLE OF INVENTION: Improvements in or Relating to 

10 Binding Proteins for Recognition of DNA 

12 (iii) NUMBER OF SEQUENCES: 125 

14 (iv) CORRESPONDENCE ADDRESS: 

15 (A) ADDRESSEE: Pillsbury Madison & Sutro, L.L.P. 

16 (B) STREET: 1100 New York Avenue, N.W. 

17 (C) CITY: Washington 

18 (D) STATE: D.C. 

19 (E) COUNTRY: USA 

20 (F) ZIP: 20005-3918 

22 (v) COMPUTER READABLE FORM: 

23 (A) MEDIUM TYPE: Diskette 

24 (B) COMPUTER: IBM PC compatible 

25 (C) OPERATING SYSTEM: PC -DOS /MS -DOS 

26 (D) SOFTWARE: Word Perfect 
2 8 (vi) CURRENT APPLICATION DATA: 

C--> 29 (A) APPLICATION NUMBER: US/10/033 , 12 9C 

C--> 30 (B) FILING DATE: 27-Dec-2001 

31 (C) CLASSIFICATION: 

48 (vii) PRIOR APPLICATION DATA: 

W--> 34 (A) APPLICATION NUMBER: US 08/793,408 

35 (B) FILING DATE: 02-JUN-1997 

W--> 37 (A) APPLICATION NUMBER: PCT/GB95/01949 

38 (B) FILING DATE: 17-AUG-1995 

W--> 41 (A) APPLICATION NUMBER: GB 9514698.1 

42 (B) FILING DATE: 18-JUL-1995 

W--> 45 (A) APPLICATION NUMBER: GB 9422534.9 

46 (B) FILING DATE: 08-NOV-1994 

W--> 4 9 (A) APPLICATION NUMBER: GB 9416880.4 

50 (B) FILING DATE: 20-AUG-1994 

52 (2) INFORMATION FOR SEQ ID NO: 1: 

54 (i) SEQUENCE CHARACTERISTICS: 

55 (A) LENGTH: 60 base pairs 

56 (B) TYPE: nucleic acid 

57 (C) STRANDEDNESS : single 

58 (D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



65 CTCCTGCAGT TGGACCTGTG CCATGGCCGG CTGGGCCGCA TAGAATGGAA 5 0 
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PATENT APPLICATION: US/10/033 , 129C TIME: 11:31:09 
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Output Set: N:\CRF4\08162005\J033129C.raw 

66 CAAC TAAAGC 6 0 

68 (2) INFORMATION FOR SEQ ID NO: 2: 

70 (i) SEQUENCE CHARACTERISTICS: 

C--> 71 (A) LENGTH: 92 amino acids 

72 (B) TYPE: amino acid 

73 (C) STRANDEDNESS : 

74 (D) TOPOLOGY: unknown 
76 (ii) MOLECULE TYPE: protein 

78 (ix) FEATURE: 

79 (A) NAME/KEY: variable amino acid residue 

80 (B) LOCATION: 57 

81 (C) IDENTIFICATION METHOD: by experiment 

82 (D) OTHER INFORMATION: Arg or Lys 
84 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

86 Met Ala Glu Glu Arg Pro Tyr Ala Cys Pro 

87 5 10 

88 Val Glu Ser Cys Asp Arg Arg Phe Ser Arg 

89 15 20 

90 Ser Asp Glu Leu Thr Arg His lie Arg lie 

91 25 30 

92 His Thr Gly Gin Lys Pro Phe Gin Cys Arg 

93 35 40 
W-~> 94 lie Cys Met Arg Asn Phe Ser Xaa Xaa Xaa 

95 45 50 

96 Xaa Leu Xaa Xaa His Xaa Xaa Thr His Thr 

97 55 60 

98 Gly Glu Lys Pro Phe Ala Cys Asp lie Cys 

99 65 70 

100 Gly Arg Lys Phe Ala Arg Ser Asp Glu Arg 

101 75 80 

102 Lys Arg His Thr Lys lie His Leu Arg Gin 

103 85 90 

104 Lys Asp 

106 (2) INFORMATION FOR SEQ ID NO: 3: 

108 (i) SEQUENCE CHARACTERISTICS: 

109 (A) LENGTH: 26 base pairs 

110 (B) TYPE: nucleic acid 

111 (C) STRANDEDNESS: single 

112 (D) TOPOLOGY: linear 

114 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

116 TATGACTTGG ATGGGAG AC C GCCTGG 2 6 

118 (2) INFORMATION FOR SEQ ID NO: 4: 
12 0 (i) SEQUENCE CHARACTERISTICS: 

121 (A) LENGTH: 28 base pairs 

122 (B) TYPE: nucleic acid 

123 (C) STRANDEDNESS: single 

124 (D) TOPOLOGY: linear 

126 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

128 AATTCCAGGC GGTCTCCCAT CCAAGTCA 2 8 
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13 0 (2) INFORMATION FOR SEQ ID NO: 5: 

132 (i) SEQUENCE CHARACTERISTICS: 

133 (A) LENGTH: 21 base pairs 

134 (B) TYPE: nucleic acid 

135 (C) STRANDEDNESS : single 

136 (D) TOPOLOGY: linear 

138 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

14 0 TATATAGCGT GGGCGTATAT A 21 
142 (2) INFORMATION FOR SEQ ID NO: 6: 

144 (i) SEQUENCE CHARACTERISTICS: 

145 (A) LENGTH: 24 base pairs 

146 (B) TYPE: nucleic acid 

147 (C) STRANDEDNESS: single 

14 8 (D) TOPOLOGY: linear 

150 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

152 GCGTATATAC GCCCACGCTA TATA 24 

154 (2) INFORMATION FOR SEQ ID NO: 7: 

156 (i) SEQUENCE CHARACTERISTICS: 

157 (A) LENGTH: 21 base pairs 

158 (B) TYPE: nucleic acid 

15 9 (C) STRANDEDNESS: single 
160 (D) TOPOLOGY: linear 

162 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

164 TATATAGCGN NNGCGTATAT A 21 

166 (2) INFORMATION FOR SEQ ID NO: 8: 

168 (i) SEQUENCE CHARACTERISTICS: 

169 (A) LENGTH: 24 base pairs 

170 (B) TYPE: nucleic acid 

171 (C) STRANDEDNESS: single 

172 (D) TOPOLOGY: linear 

174 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

176 GCGTATATAC GCNNNCGCTA TATA 24 

179 (2) INFORMATION FOR SEQ ID NO: 9: 

181 (i) SEQUENCE CHARACTERISTICS: 

182 (A) LENGTH: 33 base pairs 

183 (B) TYPE: nucleic acid 

184 (C) STRANDEDNESS: single 

185 (D) TOPOLOGY: linear 

187 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

189 TTCCATGGAG ACGCAGAAGC CCTTCAGCGG CCA 3 3 

191 (2) INFORMATION FOR SEQ ID NO: 10: 

193 (i) SEQUENCE CHARACTERISTICS: 

194 (A) LENGTH: 33 base pairs 

195 (B) TYPE: nucleic acid 

196 (C) STRANDEDNESS: single 

197 (D) TOPOLOGY: linear 

199 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

2 01 TTCCATGGAG ACGCAGGTGA GTTCCTCACG CCA 3 3 

203 (2) INFORMATION FOR SEQ ID NO: 11: 
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Input Set : A:\PTO.AMC.txt 

Output Set: N:\CRF4\08162005\J033129C.raw 

205 (i) SEQUENCE CHARACTERISTICS: 

206 (A) LENGTH: 33 base pairs 

207 (B) TYPE: nucleic acid 

208 (C) STRANDEDNESS: single 

209 (D) TOPOLOGY: linear 

211 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

213 CCCCTTTCTC TTCCAGAAGC CCTTCAGCGG CCA 3 3 

215 (2) INFORMATION FOR SEQ ID NO: 12: 

217 (i) SEQUENCE CHARACTERISTICS: 

218 (A) LENGTH: 33 amino acids 

219 (B) TYPE: amino acid 
22 0 (C) STRANDEDNESS: 

221 (D) TOPOLOGY: unknown 

223 (ii) MOLECULE TYPE: peptide 

225 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

22 7 Met Ala Glu Glu Lys Pro Phe Gin Cys Arg 
228 5 10 

23 0 lie Cys Met Arg Asn Phe Ser Asp Arg Ser 
231 15 20 

233 Ser Leu Thr Arg His Thr Arg His Thr Gly 

234 25 30 
2 36 Glu Lys Pro 

239 (2) INFORMATION FOR SEQ ID NO: 13: 

241 (i) SEQUENCE CHARACTERISTICS: 

242 (A) LENGTH: 3 3 amino acids 

243 (B) TYPE: amino acid 

244 ( C ) STRANDEDNESS : 

245 (D) TOPOLOGY: unknown 
247 (ii) MOLECULE TYPE: peptide 

249 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

251 Met Ala Glu Glu Lys Pro Phe Gin Cys Arg 

252 5 10 

254 lie Cys Met Arg Asn Phe Ser Glu Arg Gly 

255 15 20 

257 Thr Leu Ala Arg His Glu Lys His Thr Gly 

258 25 30 
26 0 Glu Lys Pro 

262 (2) INFORMATION FOR SEQ ID NO: 14: 

264 (i) SEQUENCE CHARACTERISTICS: 

265 (A) LENGTH: 2 7 amino acids 

266 (B) TYPE: amino acid 

267 (C) STRANDEDNESS: 

268 (D) TOPOLOGY: linear 
2 70 (ii) MOLECULE TYPE: peptide 

272 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

2 74 Phe Gin Cys Arg lie Cys Met Arg Asn Phe 
275 5 10 

2 77 Ser Gin Gly Gly Asn Leu Val Arg His Leu 
278 15 20 



file://C:\CRF4\OUTHOLD\VsrJ033 1 29C.htm 



8/16/05 



Page 5 



RAW SEQUENCE LISTING DATE: 08/16/2005 

PATENT APPLICATION: US/10/033 , 129C TIME: 11:31:09 

Input Set : A:\PTO.AMC.txt 

Output Set: N:\CRF4\08162005\J033129C.raw 



280 Arg His Thr Gly Glu Lys Pro 

281 25 

283 (2) INFORMATION FOR SEQ ID NO: 15: 
2 85 (i) SEQUENCE CHARACTERISTICS: 

286 (A) LENGTH: 26 amino acids 

287 (B) TYPE: amino acid 

288 (C) STRANDEDNESS: 

289 (D) TOPOLOGY: unknown 
2 91 (ii) MOLECULE TYPE: peptide 

293 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

2 95 Phe Gin Cys Arg lie Cys Met Arg Asn Phe 
296 5 10 

2 99 Ser Gin Ala Gin Thr Leu Gin Arg His Leu 
300 15 20 

3 02 Lys His Thr Gly Glu Lys 
303 25 

305 (2) INFORMATION FOR SEQ ID NO: 16: 
307 (i) SEQUENCE CHARACTERISTICS: 

3 08 (A) LENGTH: 2 6 amino acids 

309 (B) TYPE: amino acid 

310 (C) STRANDEDNESS: 

311 (D) TOPOLOGY: unknown 
313 (ii) MOLECULE TYPE: peptide 

315 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

317 Phe Gin Cys Arg lie Cys Met Arg Asn Phe 

318 5 10 

320 Ser Gin Ala Ala Thr Leu Gin Arg His Leu 

321 15 20 

323 Lys His Thr Gly Glu Lys 

324 25 

32 6 (2) INFORMATION FOR SEQ ID NO: 17: 
328 (i) SEQUENCE CHARACTERISTICS: 

32 9 (A) LENGTH: 26 amino acids 

330 (B) TYPE: amino acid 

331 (C) STRANDEDNESS: 

3 32 (D) TOPOLOGY: unknown 

3 34 (ii) MOLECULE TYPE: peptide 

336 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

33 8 Phe Gin Cys Arg lie Cys Met Arg Asn Phe 
339 5 10 

341 Ser Gin Ala Gin Asp Leu Gin Arg His Leu 

342 15 20 

344 Lys His Thr Gly Glu Lys 

345 25 

347 (2) INFORMATION FOR SEQ ID NO: 18: 

34 9 (i) SEQUENCE CHARACTERISTICS: 

350 (A) LENGTH: 89 amino acids 

351 (B) TYPE: amino acid 

352 (C) STRANDEDNESS: 
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RAW SEQUENCE LISTING DATE : 08/15/2005 

PATENT APPLICATION: US/10/033 , 129C TIME: 08:35:39 

Input Set : A:\PTO.DA.txt 

Output Set: N:\CRF4\08152005\J033129C.raw 

SEQ0EN cb™ eh/"'*- -V 

(1) GENERAL INFORMATION: - v V 



5 


(i) 


APPLICANT: CHOO, Yen 




6 




KLUG # Aaron 


7 




SANCHEZ GARCIA, Isidro 




9 


(ii) 


TITLE OF INVENTION: Improvements in or Relating to 


10 




Binding Proteins for Recognition of DNA 


12 


(iii) 


NUMBER OF SEQUENCES: 125 




14 


(iv) 


CORRESPONDENCE ADDRESS: 




15 




(A) ADDRESSEE: Pillsbury Madison & Sutro, 


L.L.P. 


16 




(B) STREET: 1100 New York Avenue, N.W. 




17 




(C) CITY: Washington 




18 




(D) STATE: D.C. 




19 




(E) COUNTRY: USA 




20 




(F) ZIP: 20005-3918 


22 


(v) 


COMPUTER READABLE FORM: 


23 




(A) MEDIUM TYPE: Diskette 


24 




(B) COMPUTER: IBM PC compatible 


25 




(C) OPERATING SYSTEM: PC -DOS /MS -DOS 




26 




(D) SOFTWARE: Word Perfect 




28 


(vi) 


CURRENT APPLICATION DATA: 




C--> 29 




(A) APPLICATION NUMBER: US/10/033 , 12 9C 




C--> 30 




(B) FILING DATE: 27-Dec-2001 




31 




(C) CLASSIFICATION: 


45 


(vii) 


PRIOR APPLICATION DATA: 


34 




(A) APPLICATION NUMBER: PCT/GB95/01949 


35 




(B) FILING DATE: 17-AUG-1995 




38 




(A) APPLICATION NUMBER: GB 9514698.1 




39 




(B) FILING DATE: 18-JUL-1995 




42 




(A) APPLICATION NUMBER: GB 9422534.9 




43 




(B) FILING DATE: 08-NOV-1994 




46 




(A) APPLICATION NUMBER: GB 9416880.4 




47 




(B) FILING DATE: 20-AUG-1994 





ERRORED SEQUENCES 

323 (2) INFORMATION FOR SEQ ID NO: 17: 

325 (i) SEQUENCE CHARACTERISTICS: 

326 (A) LENGTH: 26 amino acids 
32 7 (B) TYPE: amino acid 

328 ( C ) STRANDEDNESS : 

329 (D) TOPOLOGY: unknown 



> 
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■ RAW SEQUENCE LISTING DATE : 08/15/2005 

PATENT APPLICATION: US/10/033 , 129C TIME: 08:35:39 

Input Set : A:\PTO.DA.txt 

Output Set: N:\CRF4\08152005\J033129C.raw 
331 (ii) MOLECULE TYPE: peptide 

333 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

335 Phe Gin Cys Arg lie Cys Met Arg Asn Phe 

336 5 10 

338 Ser Gin Ala Gin Asp Leu Gin Arg His Leu 

339 15 20 
341 Lys His Thr Gly Glu Lys 

E--> 342 2S~ ^ 
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PATENT APPLICATION: US/10/033 , 12 9C TIME: 08:35:41 



Input Set : A:\PTO.DA.txt 

Output Set: N:\CRF4\08152005\J033129C.raw 

L:29 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER:] 
L:30 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 
L:68 M:220 C: Keyword misspelled or invalid format, [(A) LENGTH:] 
L:91 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:2 after pos.:40 
M:341 Repeated in SeqNo=2 

L:342 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 17 
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BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 
(ETblack borders 

□ image cut off at top, bottom or sides 

□ faded text or drawing 

□ blurred or illegible text or drawing 

□ skewed/slanted images 

□ color or black and white photographs 

□ gray scale documents 

□ lines or marks on original document 

□ reference(s) or exhibit(s) submitted are poor quality 

□ OTHER: ; . 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



